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Q , One obtains Bell's inequalities if one posits a hypothetical joint probability 

distribution, or measure, whose marginals yield the probabilities produced 
by the spin measurements in question. The existence of a joint measure 
is in turn equivalent to a certain causality condition known as "screen- 
ing off". We show that if one assumes, more generally, a joint quantal 
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00 . measure, or "decoherence functional", one obtains instead an analogous 

^ \ inequality weaker by a factor of \/2. The proof of this "Tsirel'son inequal- 

ly-^ \ ity" is geometrical and rests on the possibility of associating a Hilbert 

O ■ space to any strongly positive quantal measure. These results lead both 

Q . to a question: "Does a joint measure follow from some quantal analog of 
'screening off' ?" , and to the observation that non-contextual hidden vari- 

Q^l ables are viable in histories-based quantum mechanics, even if they are 

^ excluded classically. 



" Le Moyne College, Syracuse New York 13214, USA, and Hamilton College, Clinton New 
York 13323, USA. 

^ Blackett Laboratory, Imperial College, London SW7 2AZ, UK, and Perimeter Institute, 
Waterloo, Ontario N2L 2Y5, Canada. 

Institute for Theoretical Physics, University of Utrecht, Minnaert Building, Leuvenlaan 
4, 3584 CE Utrecht, The Netherlands 

Department of Physics, Hamilton College, Clinton New York 13323, USA. 

^ Blackett Laboratory, Imperial College, London SW7 2AZ, UK. 

f Perimeter Institute, Waterloo, Ontario N2L 2Y5, Canada, and Department of Physics, 
Syracuse University, Syracuse New York 13244, USA. 



1 



I. Introduction 



Thinking of an experiment designed to test the BeU inequalities, we might picture to our- 
selves a source emitting a pair of silver atoms with correlated spins, and downstream, two 
Stern-Gerlach analyzers in spacelike separated regions, A and B. For each setting of the 
two analyzers one would obtain a set of 2 x 2 = 4 experimental probabilities (frequen- 
cies) corresponding to the four possible combinations of spin-up-or-down. By differently 
orienting one or both of the analyzers, one could similarly produce further sets of four 
experimental probabilities. A collection of probabilities obtained in this way, we will refer 
to as a system of experimental probabilities. The Bell inequality [1] (or more precisely its 
offspring, the Clauser-Horne-Shimony- Holt-Bell (CHSHB) inequality [2] [3]) pertains to 
such a system of experimental probabilities in the special case obtained by limiting each 
analyzer to only two possible settings (say a and a' for the ^-analyzer, and b and b' for 
the S-analyzer). 

Via a derivation that we recall below, the CHSHB inequality follows almost imme- 
diately from an assumption which we will express by saying that the given system of 
experimental probabilities admits a joint probability distribution. To clarify what this 
means, notice that, a priori, one has (with two settings each for the analyzers) four en- 
tirely distinct probability distributions, each living in its own four-element sample space 

= X ^^/3, where a ranges over the settings a or a' of A, (3 ranges over the settings 
6 or 6' of S, and each space Oq,, is a binary sample space, corresponding to the two 
possibilities, spin- up/spin-down. To say that these probabilities admit a joint distribution 
means that one can merge the ilap into a single sample space 

n = X Via' X X n^/ (1) 

of 2^ = 16 elements, and that one can define on f2 a (not necessarily unique) probability 
distribution from which, for example, the probabilities for Vlah = x 0^ follow on summing 
over the possible a' and b' outcomes. * That is, the separate distributions on the spaces 0,^(3 

* That there are 16 experimental probabilities and 16 joint probabilities is merely a co- 
incidence. The two numbers would differ if we generalized to particles of higher spin or 
considered more than two settings per analyzer. 
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can be recovered as marginals from a single probability distribution on the joint sample 
space f2. 

In effect one is assigning a meaning to the so called "counterfactual" question, "What 
would I find if I could observe all four spin axes a, a', b and b' at once?" And — crucially 
— one is assuming that the distributions for the Qa/3 (induced as marginals from the joint 
distribution on Q) are merely "revealed" but not altered by the particular way in which 
the analyzers are set, the "context" of the observation. For this reason, the assumption 
of a joint probability distribution is often alternatively described as the assumption of 
non-contextual "hidden variables" [4] , and the violation of the CHSHB inequality is then 
described as an experimental refutation of such hidden variables theories. It is also de- 
scribed as a refutation of "local causality" because a condition of that type implies the 
existence of non-contextual hidden variables. 

Thus far, however, the implicit context of our discussion has been entirely classical, and 
one may wonder to what extent the relationships we have just reviewed carry over to the 
quantum case. It might seem that this question is ill posed, for the lack of a quantal analog 
of the notion of joint probability distribution. However, if one views quantum mechanics 
from a "histories" standpoint, then it is natural to regard it as a kind of generalized theory 
of probability or measure (probability being realized mathematically in terms of the concept 
of measure). Indeed, one can delineate a hierarchy of such generalized measure theories [5] 
in which classical stochastic theories comprise the first level of the hierarchy and unitary 
quantum theories — suitably interpreted — are included in the second level. (See also [6].) 

Within this second level, the level of "quantal measures" or "decoherence functionals" 
(we use the terms interchangeably), one has a notion of joint quantal measure, in direct 
analogy to the notion of joint probability distribution. We will see that just as the as- 
sumption that the experimental probabilities admit a joint classical measure leads to the 
CHSHB inequality, so the assumption that they admit a joint quantal measure leads, al- 
most as directly, to an analogous but weaker constraint known as the Tsirel'son inequality 
[7]. The main result of this paper, then, is that the latter inequality can be understood as 
a direct analog of the CHSHB inequality if one adopts a histories formulation of quantum 
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mechanics. Such a formulation also leads to a geometrical proof of the inequality that, we 
believe, has some independent interest in its own right. ^ 

Of course, the connection between the CHSHB inequality and the existence of a joint 
measure is far from the whole story in the classical case, because the strongest support 
for the latter assumption usually comes from considerations of causality and/or locality. 
Of particular importance in this connection is the condition known variously as "local 
causality" [10], "stochastic Einstein locality" [11], or "classical screening off " . 

The condition of "screening off" on the classical measure asserts that events in causally 
unrelated regions of spacetime become independent (are "screened off" from one another) 
when one conditions on a complete specification of the history in their mutual causal past. 
As shown by Fine [4], the derivation of the CHSHB inequality from screening off can 
be viewed as a two-step process. First one goes from screening off to the existence of a 
joint probability distribution, and then from the latter to the inequality. (The converse 
implications are also valid [4].) The violation in nature of the CHSHB inequality is thus 
also a violation of classical screening off. Usually this is described as a "failure of locality" , 
but because screening off is above all a condition of relativistic causality, it might be 
more appropriate to rather characterize violation of the CHSHB inequality as a "failure of 
(classical) causality" . 

Should quantum mechanics, then, be thought of as nonlocal, acausal, or both — or 
is there a sense in which it is neither if seen from an appropriate vantage point? We 
would have liked, in the present paper, to provide such a vantage point by showing that 
the classical threefold equivalence among screening off, the existence of a joint probability 
measure, and the CHSHB inequality reproduces itself at a higher level (namely level two) as 

^ A related result, which identifies "quantum Bell inequalities" which are necessary condi- 
tions for a set of two-qubit states to be the reduced states of a mixed state of three qubits, 
appears in [8]. 

Not all authors distinguish between these concepts, but we try to do so consistently 
here, cf. [9] . By locality we mean the failure of physical influences to "jump over regions of 
spacetime", and by causality (in the sense of relativistic causality) we mean their failure 
to act outside the future light cone. For example, a theory containing "tachyons" might 
be local without being causal. 
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a relationship among quantal screening off, the existence of a joint decoherence functional, 
and the Tsirel'son inequality (which of course is not violated by quantum mechanics). 

The proof that a joint quantal measure implies the Tsirel'son inequality accomplishes 
this in part, but we are unable to complete the story in all generality because we lack 
a fully convincing formulation of quantal screening off. Nevertheless, we will suggest a 
candidate condition that closely resembles its classical analog, and that is formally valid in 
relativistic quantum field theory. We will be able to prove that any system of experimental 
probabilities that admits a joint decoherence functional also admits a model which obeys 
this screening off condition; but the converse eludes us, and so we cannot yet assert that 
screening off is fully equivalent to a joint measure in the quantal case. We will show, how- 
ever, that a causality assumption inherent in standard unitary quantum theory, namely the 
commuting of spacelike separated operators, does imply the existence of a joint measure. 
This provides a kind of converse and shows in particular how our proof of the Tsirel'son 
inequality can be founded on a recognizable causality condition. 

II. Quantum mechanics as quantum measure theory 

We briefly summarize the hierarchy of generalized measure theories described in more 
detail in [5] [12] [13]. 

In a generalized measure theory, there is a sample space Q, of possibilities for the system 
in question. Normally these are to be thought of as "fine grained histories" , meaning as 
complete a description of physical reality as is conceivable in the theory, e.g. for n-particle 
mechanics a history would be a set of n trajectories, and for a scalar field theory, a history 
would be a field configuration on spacetime. Predictions about the system — the dynamical 
content of the theory — are to be gleaned, in some way or another, from a (generalized) 
measure /x on O (strictly, on some suitable class of "measurable" subsets of fi, but we will 
gloss over this technicality here). 

Given /i (a non- negative real- valued set function), we can construct the following series 
of symmetric set functions: 

/i(x)^M^) 

hiX, Y) = fi{X UY)- fi{X) - fi{Y) 
h{X, y, Z) = ii{X UYUZ)- p,{X UY)- ii{Y UZ)- ii{Z UX) + ii{X) + ii(Y) + ii{Z) 
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and so on, where X, Y, Z, etc. are disjoint subsets of Q, as indicated by the symbol 'U' 
for disjoint union. 

A measure theory of level k is one which satisfies the sum rule Ik+i =0. It is 
known that this condition implies that all higher sum rules are automatically satisfied, 
viz. Ik+n = for all n > 1. A level 1 theory is thus one in which the measure satisfies 
the usual Kolmogorov sum rules of classical probability theory, classical Brownian motion 
being a good example. A level 2 theory is one in which the Kolmogorov sum rules may be 
violated but Is is nevertheless zero. Unitary quantum mechanics satisfies this condition 
and is an example of a level 2 theory — which we dub therefore "quantum measure theory" 
in general. 

The existence of a normalized quantum measure on Q is equivalent to the existence 
of a decoherence functional D{X; Y) of pairs of subsets of Q satisfying: * 

(i) Hermiticity: D{X;Y) = D{Y;Xy , yX,Y; 

(ii) Additivity: D{X UY;Z) = D{X; Z) + D{Y; Z) , VX, F, Z with X and Y disjoint; 

(iii) Positivity: D{X;X) > , VX; 

(iv) Normalization: -0(0; fi) = 1 . 

The relationship between the quantal measure and the decoherence functional is 

^i{X) = D{X■X). (2) 

Unless otherwise stated, we will always assume that D satisfies in addition to (iii) the 
condition of strong positivity, which states that for any finite collection of (not necessarily 
disjoint) subsets Xi,X2, ...X^ of fi, the nx n Hermitian matrix Mij = D{Xi;Xj) is 
positive semidefinite (it has no negative expectation values). The decoherence functional of 

* The quantity D{X;Y) is interpretable as the quantum interference between two sets 
of histories in the case when they are disjoint. Notice from (2) that fj, determines only 
the real part of D. (The imaginary part of D infiuences how smaller systems combine to 
form bigger ones. It also may affect the consistency /decoherence conditions one wishes to 
impose. These issues will be discussed in greater depth elsewhere.) 
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ordinary unitary quantum mechanics, for example, is strongly positive. Strong positivity is 
a powerful requirement because it implies in general that there is a Hilbert space associated 
with the quantum measure, which turns out to be the standard Hilbert space in the case 
of unitary quantum mechanics [14] [15]. Decoherence functionals which merely satisfy 
condition (iii) above are termed "weakly positive". 

In this paper, we will not enter into the general question of how to interpret the 
quantum measure. One set of ideas for doing so goes by the name of "consistent histories" 
or "decoherent histories" and attempts in effect to reduce the quantal measure to a classical 
one by the imposition of decoherence conditions [16] [17] [18] [19]. A different attempt at 
an interpretation, based on the notions of "preclusion" and correlation, may be found 
in [12]. For our purposes in this paper, it will suffice to assume, where macroscopic 
measuring instruments are concerned, that distinct "pointer readings" do not interfere 
(they "decohere" ) , and that their measures can be interpreted as probabilities in the sense 
of frequencies. 

In the sequel, we adopt a usage that seems particularly suitable for a histories-based 
measure theory. We use the terminology "an event in spacetime region ^" to refer to a 
subset a CQ such that the criterion which determines whether or not a history 7 belongs 
to a refers only to the properties of 7 within A (e.f/. if 7 is a field then its restriction to A 
is supposed to be enough information to determine whether 760).^ 

We will also assume that all of our sample spaces 0, are finite, so that integrals may be 
written as sums. Among other things, this lets us avoid the main technical complications 
in the definition of conditional probability. 

^ The term "event" is standard in probability theory for a subset of O. A subset of histories 
defined by some common property is termed a "coarse grained history" in the standard 
parlance of consistent histories quantum theory. In this language, an event in A is therefore 
a coarse grained history defined by a coarse graining according to properties local to A. 
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III. Two inequalities, classical and quantal 



Classical case 

We rehearse the proof of the CHSHB inequahty at level one in the hierarchy of generalized 
measures — i.e. at the classical level. The experimental context will be that described in 
the Introduction. In formalising it, however, one faces a choice. Namely, one must decide 
whether or not to include random variables corresponding to the instrument settings in the 
analysis. If one excludes such variables, then one need deal only with the sample spaces 
described in the Introduction: the four spaces the four spaces and Qp, and the 
joint sample space O. (Recall that in our notation, a — a or a' , and (3 = b or b' , the in- 
strument settings in regions A and B.) On the other hand, if one includes the instrument 
settings as variables, then one necessarily deals with a larger sample space Q. We have 
chosen to follow the second approach (which arguably is "more fully intrinsic" , in keeping 
with the philosophy of generalized quantum mechanics as a theory of closed systems) , and 
consequently our discussion will attribute probabilities not only to the possible outcomes 
with a given experimental arrangement, but also to the possible experimental arrange- 
ments themselves. Nevertheless, the following may also be read consistently as if the first, 
more "minimalist" approach had been adopted, since, mutatis mutandis, the proofs take 
the same form in both cases. (One who feels uncomfortable attributing quantitative prob- 
abilities to instrument settings may thus refrain from doing so.) The essential difference 
between the two approaches is that in the "minimalist" reading, conditional probabilities 
like Proh{outcome\setting) must be understood as primitive objects; they cannot be re- 
solved into ratios of conditional probabilities like Proh{outcome n setting) /Pmh{setting) . 

Consider a sample space, Q, of histories defined on a "substratum" possessing a back- 
ground causal structure (a spacetime, for example, or a causal set). Let A and B be two 
spacelike separated regions of the substratum and denote their causal pasts by J~ (A) and 
J~{B) (where J~{A) contains A itself). We are interested in the usual EPRB setup in 
which there is a range of possible choices (to be made "essentially freely") of settings of 
some experimental apparatus in A and similarly in B. For example, this range might be 
the possible directions of the magnetic field in a Stern-Gerlach apparatus for spin mea- 
surements. For each setting in A, the outcome of the measurement is either or —1. In 
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the standard example this would be the measured value of the spin (multiplied by in 
the set direction. 



Let Ma denote the set of possible settings of the experimental apparatus in A. (As 
mentioned in the Introduction, we limit ourselves to two settings, Ma = {a, a'}.) Each 
element of Ma is, in our technical sense, an event in A (or more generally in J~{A) fl 
J~{By, where the superscript c denotes complementation), namely, that subset of 0, 
containing those histories in which the corresponding experimental setting is made. The 
elements of Ma are disjoint. For each element, a e Ma, let a±i denote the possible 
outcomes of the measurement with setting a, so that a = q;+iUq;_i where a+i (oi-i) is the 
set of histories in which outcome +1 (—1) obtains. Similarly, Mb is the set of two possible 
S-measurements, {6,6'}; each element of Mb is an event in B (or J~{B)r]J~{A)^); and 
for each (3 G Mb, /3±i 8i.re the possible outcomes of the measurement, with (3 — (3^iL\ /3-i. 

The "law of motion" of the underlying stochastic process is assumed to be given by a 
classical probability measure /i on Q, and an expression like //(ajfl/Jj jan/?) will denote the 
probability of outcomes ctj and Pj, conditional on the settings being a and Since we are 
imagining all our sample spaces as finite, a conditional probability tJ>{x\y) = fi{xr\y) / fi{y) is 
only really meaningful when fx{y) > 0. In the contrary case, one might define it to be zero, 
since ii{y) = =^ n^xHy) = (albeit not when is quantal!), but for present purposes, it 
will prove more convenient to adopt the convention that iJ,{x\y) is simply undefined when 
IJ,{y) vanishes. 

Let be the sixteen-element sample space (1) labelled by the (16 possible values of 
the) quadruple of binary variables {ai,a^,,bj,bj,), each of which takes values ±1. (For 
brevity we will write (oj, a'-,, bj, b'^,) = (ii' j j') where there is no risk of confusion.) In the 
Introduction, we called the sixteen numbers //(ajflfej |an6), ^(a^, nbj\a'nb), //(a^n^^, |an6'), 
//(a-, n6^, |a' n6'), a "system of experimental probabilities" , and we agreed to say that these 
numbers admit a joint probability distribution if and only if there exists a classical measure, 
/i on n, such that 



and similarly for every other {a, (3) pair. It is now easy to prove the CHSHB inequality. 




(3) 



i'j' 



9 



Theorem 1 Let and /i be as described above and assume that the resulting system 
of experimental probabilities admits a joint probability distribution p, on Q satisfying the 
condition (3) on its marginals. Define the correlation functions 

X{a,P) = ^^i ■ j ■ li{airi PjlaCi P) (4) 

for q; = a, a', /3 = 6, b'. Then 

I X{a, b) + X{a', b) + X{a, b') - X{a', b')\<2 . (5) 

(The pattern is three plus signs and a minus. It doesn't matter where one puts the minus 
sign.) 

Proof It suffices to prove the inequality without the absolute value signs, as one sees 
by reversing the signs of the 5-outcomes. By assumption there exists a measure on the 
sample space Q. of quadruples (ii'jj') whose marginals agree with fi on each (a, j3) pair. 
Therefore 

X(a, b) = i ■ j ■ fJ-icii nbj \ar\b) 

ii'jj' 

with similar formulas for X{a', 6), X(a, b') and X(a', b'). But for any of the possible values 
of we have ij + i'j + ij' — i'j' — {i + + {i — < 2, since one of the two 

parentheses must vanish in every case. The weighted average with respect to of this 
combination of i's and j's is therefore also less than or equal to 2; hence 

X{a, b) + X{a\ b) + X(a, b') - X{a', b') = Y. + ''^ + ' ^'■?") /^W/) ^ 2 . 

ii'jj' 

QED 

Quantal case 

At level two we have the same setup as before: a sample space Q including setting events 
a, a', b, b', etc.; and we use the same notation, in particular a = a or a' and (3 — b oi b'. 
But now we have on 17 a quantal measure /i and the associated decoherence functional D. 
We are considering the situation in which the events ctj fl correspond to the readings 



10 



(and settings) of macroscopic instruments, and so, as announced earlier, we will assume 
that the quantal measure fj, of any one of these macroscopic "instrument events" can be 
interpreted as an experimental probability (i.e. a frequency). Having done so, we can form 
conditional probabilities in the standard manner, as illustrated by the definition of the 
p{ai,(3j) in equation (7) below. The correlators X{a,(3) are then definable exactly as in 
the classical case [equation (4)]. (Notice that we have not attempted to extend the notion 
of conditional probability outside the setting of classical (level 1) measure theory. To our 
knowledge, there is, unfortunately, no established notion of "conditional quantal measure" 
or "conditional dccoherence functional", of which classical conditional probability would 
be a special case.) 

Notice that the identification of IJ^{Y) — D{Y;Y) as a probability- gna-frequency is 
only consistent over the whole algebra of instrument events Y if we assume that neither 
distinct instrument settings nor distinct outcomes for given settings interfere with one 
another. * In other words, we must assume that 



Va, and we also assume that all remaining such off-diagonal values of D vanish, for 
example, D{ai fl bj ; fl bi) = 0. 

Definition We denote as the experimental probabilities the sixteen numbers. 



The need for a quantal generalization of conditional probability arises in the following 
only because the experimental probabilities we work with are conditioned on specific in- 
strument settings. For present purposes, it thus would not arise at all in the alternative, 
"minimalist approach" mentioned earlier. However, even in such a framework, the need 
would return as soon as one had to condition on the specific results of observations or other 
processes. 

* This consistency condition is what gives the "consistent histories" interpretation its 
name. But there, it is raised to the level of a principle. 



D{ai n (3j ; ak n (3i) = /i(ai fl (3j)Sikdji , 



(6) 



p{ai, Pj) = jiioLi n (5j\a n (5) 



^{aj n (3j) 
n{a n (3) 



(7) 
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Definition The experimental probabilities p{ai,Pj) admit a joint quantal measure iff 
there exists a decoherence functional ID on Q such that its marginals agree with (7) for 
each of the four (a,/?) pairs {i.e. for each of the four possible instrument settings): 

Dab{ij;kl)= ^ D{ii'jj' ; kk'W) =p{ai,bj)5ik5ji {8ab) 

i'j'k'l' 

for {a, pi) = (a, b); 

Da'bii'j; k'l) = J2 Diii'jj' ; kk'W) = p{a[, ,bj)5vu'h {8a'b) 

ij'kV 

for (cu,/?) = (a', 6); and similarly for {a, 13) = {a,b') and {a,P) = {a',b'). 

Remark Our use of the word "marginals" here is in obvious analogy to its use in classical 
measure theory, where, given that O = Oi x O2, a marginal probability distribution on 
O2 is one induced from O by summing over Oi. Similarly here, the joint decoherence 
functional D on induces marginal decoherence functionals D^p (and hence marginal 
quantal measures jtta^) on all the {a,P) pairs, as illustrated in (Sab) and {8a'b). 

Observe that the matching-conditions (8) on the marginals require more than just 
agreement with the 16 probabilities p{ai,Pj). They also entail the vanishing of the 24 
off-diagonal elements Dc^piij] kl) with i ^ k or j ^ I. Notice on the other hand, that they 
do not refer to any marginals that would involve interference between distinct instrument 
settings. 

We will see that the Tsirel'son inequality follows from the existence of such a joint 
quantal measure. However, in order to demonstrate this, we will need to apply to a 
certain basic construction via which any strongly positive decoherence functional gives rise 
to a Hilbert space [14] [15]. 

Hubert space from (strongly positive) quantal measure 

Consider the vector space ^1 which consists of all formal linear combinations of the sixteen 
four-bit strings, {ii' j j'), i,i',j,j' = ±1. Let [ii'jj'] denote a general basis vector of 
That D is strongly positive means that it induces a (possibly degenerate) Hermitian inner 
product on i^i given by: 

{[ii'jj'i[kk'll']) = D{ii'jf;kk'll') . 
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In general, i^i is not a Hilbert space because it contains vectors with zero norm. To form 
a true Hilbert space take the quotient of by the vector subspace i^o of zero norm 
states: = S)i/S)o. Denote by the vector in 9) that corresponds to [ii'jj'] G i^i. 

(Regarding members of !^i/S)o as equivalence classes, we can describe as the set of 

vectors in i^i that differ from [ii' jj'] by vectors of zero norm.) Plainly, the vectors 
span i^; and we have*" 

D{ii' jj'-kk'll') = {ii'jj'\kk'll') . (9) 

This relationship will let us convert the correlators X{a, (3) into inner products of vectors 
in i3, the key step in our proof of Theorem 2. 

Correlators as inner products in Hilbert space: proof of Theorem 2 

In this subsection, we state and prove our main result as a theorem. We assume that 
the experimental probabilities admit a joint quantal measure given by the decoherence 
functional D on f], as specified in equations (8), and we denote by p, the corresponding 
generalized measure given by the diagonal elements of D, as in equation (2). 

Lemma 3.1 Let \a) e S) he defined by 

\a) = E ^ (10) 

ii'jj' 

and similarly for |a'), \b) and \b'). Then {a\a) = {b\b) = {a'\a') = {b'\b') = 1, and for any 
of the four possible pairings of a = a, a' with P = b,b' , we have 

{a\P) = X{a,p) = Y^i-3-p{ai,(5j) (11) 

Proof We give the proof for a = a, /3 = 6, the other three cases being strictly analogous. 
Using equations (8), (9) and (10), we replace the sum over diagonal terms in X{a,b) by 

^ For related observations see [20] [21] [22]. 
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the full sum: 



X{a,b)=J2i 

ij 


■j 


■p{ai,bj) 


Z J 


■I' 


■p{ai, bj)SikSji 


ijkl 








■I 


■ Dab{ij;kl) 


ijkl 








■I 


■ J2 D{n'jj'-kk'll') 


ijkl 




i'j'k'V 



ii'jj'kk'W 



= i ■ I ■ {ii' jj'\kk'll') 

ii'jj'kk'W 

={a\b) . 

We must also prove that the vectors \a), \(3) have unit norm. Let us prove for example 
that {a\a) — 1. To that end, define the vectors 

|a±) = E I ± 1^'^'^") 

i'jj' 

and note that (a + |a— ) = by (8a6) with i — +1, k = —1. Then 

\a) = \a+) - |a-) 

and we have 

(a|a) ={a+\a+) + (a — |a— ) — (a+|a— ) — (a— |a+) 
= (a+|a+) + (a-|a-) + {a+\a—) + {a—\a+) 

This last line is D(Jl; fi), which is 1 by our assumption of normalization. QED 

We are now ready to prove our main result, that any set of experimental probabilities 
which admits a joint quantal measure must respect the Tsirel'son inequality: 

Theorem 2 If there exists a strongly positive joint decoherence functional D onil whose 
marginals agree with D — meaning equations (8) hold — then 

I X(a, b) + X(a', b) + X(a, b') - X{a' , b') \ < 2^2 . (12) 
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Remark Instead of saying that the marginals "agree with D", we could equally well 
have said that they "are diagonal and yield the experimental probabilities p{ai,Pj)" . This 
expresses the theorem in a more self-contained form. 

Proof As before, it suffices to prove (12) without the absolute value signs. Write 

Q = X{a, h) + X(a', h) + X(a, h') - X{a' , h') (13) 

which has a "logical" maximum value of 4. By the previous lemma, we have 

Q = {a\b) + {a'\b) + {a\b') - {a'\b') 

(14) 

= {{a\ + {a'\)\b) + i{a\-{a'\)\b') . 

Since \b) and \b') are unit vectors, Q is maximized when \b) is parallel to \a) + \a') and \b') 
is parallel to |a) — \a'). Hence 

Q< |||a) + |a')|| + |||a)-|a')|| , 

whence Q < 2\/2 by the following simple lemma. QED 

Lemma 3.2 If u and v are vectors of unit length then | |w. + f 1 1 + | — f 1 1 < VS = 2y/2. 

Proof Let S = Utt + vH + Hw — v|| and write ^ = Re{u\v). Then ||tt±v|p = {u±v\u±v) = 
(1 + 1 ± 20 = 2 ± 2^. Hence 

S'^ = \ \u + vW"^ + \ \u — v\\'^ + 2\\u — v\ \ \ \u + 



= (2 + 20 + (2 - 20 + 2V(2 + 20(2-20 



= 4 + 2^/4 - 4^2 

< 4 + 2v/4 = 8 



QED 



An example: saturating the bound 



To illustrate some of the above, consider the familiar quantum mechanical setup leading to 
maximal violation of Bell's inequalities (5), which is known to produce a system of exper- 
imental probabilities that saturates the Tsirel'son bound (12). The mathematics involved 
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in this situation produces a joint decoherence functional that also is on the boundary of 
the convex set of strongly positive decoherence functionals. We can use this to conclude 
that by itself, weak positivity of the quantal measure {i.e. the condition D{X; X) > on 
the decoherence functional) is insufficient to imply the bound (12). 

We assume we have two spin-half particles in a singlet state and each particle heads 
off to either region A or region B where Alya and Bai, respectively, are waiting to make 
measurements on the particles. Alya sets her apparatus to measure the spin in directions 
a or a' and Bai in directions b or b' where a, a', b and b' are now unit vectors in three 
dimensional space satisfying 

aa' = bb' = 

a b = a b' = a' b = —a' • b' = — = . 

It is interesting that when calculating the quantity Q as given by ordinary quantum 
mechanics in this setup we obtain 

X{a, b) + X{a', b) + X{a, b') - X{a', b') 
= {a + a') ■b+{a- a') • b', 

which is exactly the same expression (14) as arose in the general proof of (12), only here 
we have ordinary vectors in instead of vectors in Hilbert space. 

In the EPRB setup we have a 4-dimensional Hilbert space which is a tensor product 
of two qubit Hilbert spaces and i^g, and lip) = (| '\)a\ Db — \ Da] '\)b)/V^ is the 
singlet state. On Sja we have Pauli matrices a and on we have Pauli matrices p, from 
which we can form projection operators. 

We will form the decoherence functional from the expectation value in the singlet 
state of strings of projectors onto the several values of the two spins in the four directions, 
a, a', 6, b'. Specifically, let us form the decoherence functional using the strings of projec- 
tors appropriate to the results: "Alya finds the spin to be ih/2 in the a direction and then 
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i'Ti/2 in the a' direction, and Bai finds jTi/2 in the b direction and then j'fi/2 in the b' 
direction." For this we need the projectors 

P^^\{l+jb.p) 
Pf,^\{l+j'b'-p). 

From these and the initial singlet state jV') we can construct the following decoherence 
functional that is strongly positive and decoheres on all (a, /3) pairs: 

D{ii'jj' ; kk'lV) = {MP^P^'PtPiPf'P^P?'Pni^) ■ (15) 



(This is just a decoherence functional in the sense of [19], evaluated on the coarse-grained 

oa -pa' -ph pb' 



histories represented by P^Pp PjPji , with initial state 



The decoherence functional of (15) will do the job, but there's a nicer, more symmetric 
form that will also work, where the order of the a and a' measurements is symmetrized 
and similarly for b and b': 



Dsym{}i jj '■> kk II ) 

1 / / / / / / / / (16) 

=YQ'^i^\^PkPk' +p^'P^){p!'Pi' +Pt'Pi)iP!Pr +p^'P^){p:p? +P?Pm)- 

Some simple cr-matrix algebra, using (cr + p)\'4^) = and ('0|<t|'0) = 0, because is the 
singlet, gives 

256 D,y^{ii'jj'; kk'lV) 
= {l + ik + i'k'){l+jl+j'l') 

+iik'-i'k)ijl'-j'l) 
-l=(i + k){j + l+j' + l') 

+^{i' + k'){j + l-j'-l'). 

One can easily verify that \Q\ = 2y/2 with these numbers. 
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The 16 X 16 matrix Dgym has 12 zero eigenvalues and so the Hilbert space that one 
constructs from it is four-dimensional, as one would expect for a pair of spin-| particles 
[15]. The existence of null directions also means that fl is verging on violating strong 
positivity, the matrix D being only positive semz-definite. 

Realizing the joint sample space 

The construction we have just employed can be made more vivid by relating it to a 
gedankenexperiment in which the 16 "outcomes" comprising the sample space correspond 
to actual trajectories of physical particles. The expression (15) can be interpreted in terms 
of Stern- Gerlach devices for silver atoms (or perhaps more conveniently, in terms of photon 
trajectories and interferometers, cf. the setup in [23].) An outcome of a spin measurement 
then amounts to a silver atom's emerging in either the upper or lower beam. However, 
suppose that we don't "look at" the silver atom, but instead send it through a reversed 
magnetic field designed to recombine the two beams, as if they had never been split apart 
at all. We can then pass it through a second Stern-Gerlach analyzer which again splits the 
beam into two, etc. If we concatenate two analyzers this way in region A, and two more 
in region B, then we naturally partition the full history space into 16 subsets depending 
on which beams the silver atoms traverse in their respective analyzers. In this way the 
elements of Q are realized as actual sets of histories that all pertain to a single experimental 
setup. It follows that any histories formulation* must induce in this manner a quantal 
measure on O; and the matrix-element (15) that we wrote down before is just the algebraic 
expression of this measure. 

Of course, the mere fact that the measure is well defined does not yet tell us what 
will happen if we do "look at" the particles. To make contact with the experimental prob- 
abilities p{ai, Pj), we must assume further that if we do choose to look, then the measure 
induced thereby on "us" directly reflects the measure fi on the space Cl of "microscopic" 

* Any formulation, that is, for which the silver atoms are part of the kinematics (or "ontol- 
ogy") and trace out continuous worldlines in spacetime. With discontinuous trajectories, 
the silver atoms might be present in both beams, and an event like "silver atom in upper 
beam in first analyzer" would not be well defined. It seems that something like this would 
actually occur in models such as that of [24] . 
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alternatives. That is, we must assume that "looking" at a beam merely reveals the corre- 
sponding value of fl. (Note in this connection that events in distinct spatial locations at a 
given time always decohere in unitary quantum mechanics.) 

IV. Relation to the "screening off" causality condition 

Classical case 

The condition of screening off on the classical measure fi asserts that events in causally 
unrelated regions A, B of spacetime become independent ("screened off" from one another) 
when one conditions on a complete specification c of the history in the region C = J~ {A) fl 
J~{B), the mutual causal past of ^ and B. The logic underlying this condition is that any 
correlation between spacelike separated variables must arise entirely from their separate 
correlations with some "common cause" in their mutual past, and therefore must disappear 
once full information about the past is given. 

Specialized to our situation, this screening off condition yields for all a e Ma, P £ Mb 
and i,j = ±1, 

li{ai n I3j\c) = ii(ai\c) ii{Pj\c) , (18) 

where c is any subset of O defined by a completely fine grained specification of the history 
in C = J~{A) n J~{B). Similarly (or just by summing (18) on we have 

//(a n /3|c) = //(q;|c)//(/3|c), (19) 

so that the "setting event" a is screened off from the setting event Dividing (18) by 
(19) yields an equation which, we claim, can be written as 

li{ai n PjlaCi pCic) = ii{ai\a n c)ii{f3j\f3 n c) . (20) 

t This is a strong form of the screening off condition, as it excludes in particular "primordial 
correlations". A less restrictive condition, depending on the context, would locate c in the 
union of the (exclusive) pasts of A and B. For details see [9]. 

^ This is not the only way to construe the "principle of common cause" , but it is the one 
adopted in all discussions of the Bell inequalities known to us. 
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This follows from noting that 

//(cti n /3j n a n /3 n c) 



IJ,(ai f] Pj\ar\ P f] c) = 



i^{a n /3 n c) 
//(ofi n (3j n c) 
/i(Q! n /3 n c) 
//(cKj n Pj n c)/ fi{c) 
ij,{a n p n c) / ii{c) 
li{ai n/?j|c) 



[since ctj C a, /3j C P] 



IJ,{a n /?|c) 

and 

IJ,(ai n a n c) 

uictj q: n c) = z r — 

' ^ //(anc) 

_ //(ftj n c) 

n{a n c) 
_ njaj n c)/n{c) 

n{a n c)/iJ,{c) 
_ iJ.{ai\c) 

fx{a\c) ' 

and similarly for n{Pj\P Pi c). [In these calculations, one is in effect "conditioning in 
stages" and recognizing that fi{{x\y)\z) = fi{x\yr\z) , where fi{{x\y)\z) := fi{xr\y\z)/ ii{y\z).] 
Observe that, in order for the conditional probabilities appearing in (18)-(20) to be defined, 
none of the measures //(c), //(a n c), //(/3 n c), //(a H P He) can vanish. Accordingly, (20) 
is only valid with this reservation. 

At this point, we need to formalize the idea that the instrument settings are "chosen 
freely". To that end, we will assume that, with respect to the measure /U, and for all 
a e Ma and P e Mb, the "setting events" a and P are independent of any* events in C. 
In the presence of screening off, this implies that the event "a and P" is also independent 
of any event in C: 

ii{anpnc) = ii{anp)iJi{c) . (21) 



* This is a rather drastic form of setting-independence. It would have been possible to 
include other events in the past on which the settings depended without afi^ecting the main 
points of the argument, as in [10]. 
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(It also implies that iJ,{a fl /?) = iJ,{a) iJ,{P) , so that, in the presence of screening off, the 
setting events are strictly independent of one another.) The formal derivation of (21) goes 
as follows. Our assumption of "setting-independence" says that 

IJ,{a\c) = iJ,{a) (and similarly for /?) . (22) 

Putting this together with (19), we obtain //(an/Jjc) = ii{a\c)ii{P\c) = fi{a)fj,{P), whence 
li{a n /? n c) = fi{a)ii{P)iJ,{c), whence fj,{a fl /5) = fi{a)fi{P) by summing on c. Comparing 
the first and last equations yields iJ,{a fl /3|c) = //(a fl /?), which is (21). Note finally that 
we can assume without loss of generality that //(c) > for all fine-grained specifications c 
of C (otherwise simply omit c from f2). Then, having just demonstrated that //(a fl /? fl 
c) = fj,{a)fj,{P)iJ,{c), we conclude that fj,{a fl /? fl c) never vanishes (unless we can't do the 
experiment at all!); hence equation (20) becomes valid unreservedly. 

It is well known that the screening off condition leads to the CHSHB inequality [10]. 
This follows from a result of Fine [4] according to which the existence of a joint distri- 
bution on f2 is equivalent''' to screening off. More formally, let us say that a system 
of experimental probabilities p{ai,Pj) admits a classical screening off model if one can 
find a sample space Q and a measure fj, thereon obeying (18) and (21), and such that 
IJ,{ai D (3j\a Ci /3) = p{ai, I3j). Then 

Lemma 4.1 A system of experimental probabilities admits a classical screening off model 
if and only if it admits a joint probability distribution Jl. 

Fine's treatment appears to rely tacitly on the "non-contextuality" assumption that 
settings of the remote instrument cannot affect local results. The condition that he in- 
vokes is not actually screening off as such, but what he calls "factorizability" , a con- 
dition which, as he words it, seems to be ambiguous between two formulations, the 
first of which (corresponding to our equation (20)) could be written in our notation 
as iJ,{ai n /3j\a n (3 Ci c) = ii{ai\a fl c)n{(3j\(3 fl c), and the second of which would be 
IJ,{ai n I3j\a n (3 n c) = n{ai\a H (3 H c)n{(3j\a H (3 H c). In these expressions, however, 
"conditioning" on a (for example) merely means that the instrument at A is set to a. 
Fine avoids attributing probabilities to instrument settings, in contrast to the approach 
we have adopted in this paper; his is the "minimalist approach" mooted at the beginning 
of Section III. 
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Proof (1) Let /x) be a screening off model for some system of experimental probabil- 
ities. We must demonstrate that there exists a classical measure, ju on f2, such that 

IJ,{air)bj\ar)b) = ^fl{ii' jj') 

i'j' 

and similarly for each (a, /3) pair. In the following, recall that c ranges over all subsets of 
O specified by a complete fine-grained description of J~{A) D J^{B) for which yu(c) > 0. 
Recall also that we have assumed that Q has finite cardinality. 

Since, by our assumptions, //(aflc) never vanishes, fi{ai | aflc) is defined, and we have 

^fiitti I an c) = 1 , (23) 

i 

and similarly for a', b, b'. Now make the "maximal independence ansatz", 

A*(^^'i/) = ^l^{ai\aric) ii{a'^, I a' n c) ii{bj \ br\c) ii{bj, \b' Cic) ii{c) . (24) 

c 

We claim that the marginals of agree with fx for each (a, 0) pair. For example, take 
a = a, P = b; then 

i'j' 

= 'Yn{ai\ar\c)ii{bj\br\c)n{c) [by (23)] 

c 

= Yli{ainbj\anbnc)ii{c) [by (20)] 

c 

~ ^ iiianbnc) ' 

/ d / xV (c) by (21) 
^ /x(an%(c) ^ L ^ ^ 

J2K<^i ^ n c) 

c 

n{a n 6) 
_ ;u(aj n bj) 
li{a n 6) 
= iJ,{ai n 6j|a n 6) . 
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(2) Conversely, suppose there exists a classical measure ju on f2 as described above with 
marginals that agree with /x on each (a, pi) pair. A consistent screening off model can be 
found by supposing that there were other events in the past which were not taken into 
account in the original sample space Q.. 

We will assume for the purposes of this proof that the original sample space Q, contains 
nothing but the experimental events of interest, i.e. the settings and outcomes, and in 
particular contains no events in the mutual past of A and B. It would have been possible 
to include past events of which all the experimental probabilities in VL were independent; 
no essentially new idea is needed to extend the proofs to this case, but they are excluded 
here for the sake of clarity. 

Let be a new sample space whose fine-grained histories are those of Q, with an 
additional quadruple of binary variables which we will regard as residing in the mutual 
past of A and B. (These variables play the role of the past "causes" of the experimental 
outcomes.) We claim that there exists a classical measure Ji onO, such that the measure 
Ji agrees with ji on all of the experimental events — and that Ji satisfies screening off. To 
demonstrate this, let us set, formally. 



and declare by fiat that the quadruple of binary variables lives in the mutual 

past of A and B. Then any subset of 0, can be considered a subset of Q in the obvious way. 
We write {ii'jj'} or {kk'W} for the set of all histories in D, with those particular values of 
the quadruple in the past. 

The statement of screening off for this new model is 



We must find a Jl which extends and for which (25) holds. Note that the experimental 
settings are still required to be independent of all past events, which now means {kk'W}. 
We make the ansatz 



il = {{h,i,i',jj')\he n and hi' J J' e {+1,-1}} 



]l{ai n (3j I {kk'W}) = ]l{ai I {kk'W})il{/3j \ {kk'W}) . 



(25) 



li{ai n bj n {kk'W}) = ii(a n b) Ji(kk'W) SikSji 
]l{a'i, n bj n {kk'W}) = ii{a' n b) li{kk'W) Si'k'Sji 



(26) 
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and similarly for the other two {a, P) pairs, (a, 6') and {a',b'). 

Summing one example of (26) over k, k', I, V yields, with the help of (3), 

/i(ai n 6j) = //(a n b)y^^p,{ii'jj') = iJ,{ai Dbj) . 

i'j' 

This shows that the probabilities of the experimental outcomes and settings are the same 
for fi and Jl. 

As required, the settings are also independent of the added past variables with respect 
to /i, for example: 

//(a n 6 n {kk'W}) = Jl{ai n bj n {kk'll'}) 

= ii{a n b) il{kk'll') 

= Ji{ar\b)Ji{{kk'll'}). 

The ansatz (26) also gives, after simple manipulations, 

/I(aj n bj I {kk'U'}) = SikSji]j,{a fl b) 

Jl{a^ I {kk'W}) = 5ikjl{a) 
llibj I {kk'll'}) = Sjiil{b) , 

which implies (25), so the new measure Ji satisfies screening off. QED 

The significance of the second part of the lemma is that, given the existence of the 
joint probability measure p, on Q, the observed experimental probabilities can always be 
explained classically and causally, in a suitably chosen model. (In the proof of this part 
of the lemma, the underlying idea is almost trivial, despite the somewhat complicated 
notation that expresses it in this case: if the past determines the future, then any two 
future events become independent when the past is conditioned upon. Screening off is 
thus automatic in any "deterministic" situation. The same basic fact persists for quantal 
measures and will underlie our proof of Lemma 4.2 in the next subsection (where the 
notational complications are even greater).) 

Corollary Screening off the CHSHB inequality. 
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Remark By conditioning on a given instrumental setup (a, /?), we obtain from the overall 
measure fx a probability measure on the space Waf3 = x fl^ x where Qc is the space 
of all configurations or "partial histories" in the past region C. In this way, we obtain four 
distinct "measure spaces" , where by this phrase we simply mean a sample-space endowed 
with a measure. The proof of part (1) of the lemma in effect "patches" these four measure 
spaces together into a single measure space W = fla x ^a' x Qb x ^b' x Oc, in a manner 
reminiscent of a fibre product, with Qc playing the role of common base space. The 
ansatz (24) then produces Q (with /2) as the marginal measure space resulting from W by 
neglecting C. 

An alternative to screening off? 

This might be an appropriate place to comment on the possibility of a different deriva- 
tion of the CHSHB inequality, in which an enhanced locality condition does some of the 
work done by screening off in the proof of Lemma 4.1. Can one, in fact, demonstrate the 
existence of a joint decoherence functional without invoking screening off as such? This 
is an interesting question because it is perhaps not settled that screening off is the true 
expression of relativistic causality, even in the classical case [9] [25] . Here, then, is such an 
alternative derivation (albeit not as precisely formulated). 

We start from the assumption that instruments at A (resp. B) respond only to certain 
local variables ("beables") (resp. ^b) defined in A (resp. B). The perfect correlations 
that arise in the singlet state then imply that these local variables determine the response 
unambiguously, without any stochastic component (this being the EPR observation); and 
we may assume that this is always so, even in examples such as that of [6], where the 
correlations are not perfect. On this basis, we immediately acquire our sample space Q, 
parameterized by the values of the local variables ^a,b- 

We also need a matching probability measure fi on fl. For this, we must assume that 
the choice of instrument setting at ^ — including the choice of no measurement at all — 
cannot influence local variables at B, and vice versa for settings at B. It follows that we get 
a well defined (setting independent) probability distribution on the variables ^a, Cb, and 
this induces a probability measure on Q, whose marginals are obviously the experimental 
probabilities, p{ai,bj), etc. 
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We see that screening off as such was not used. In its place was the assumption that 
instrument settings do not influence the "hidden variables" and ^b- We tacitly assumed 
as well, of course, that the variables ^ do not influence the instrument settings, i.e. that 
the latter were "free" in relation to this particular set of microscopic variables. Notice also 
that the derivation in this form did not require us to attribute probabilities to instrument 
settings, except insofar as this would be one way to make precise their "freedom" in respect 
of the variables ^. 

In the derivation just described, the "local beables" ^ provide a "material basis" for 
the sample space fl, in the sense that elements of fl represent equivalence classes of histories 
determined by the values of those beables. In contrast, the proof we gave earlier merely 
concocted a space Q (and a measure on it), without attempting to identify it with any 
actual set of physical histories (c/. the remarks under "Realizing the joint sample space", 
above.) 

Quantal case; Proposal for quantal screening off 

We have seen that classically, there is an equivalence between screening off and the existence 
of a joint probability measure on all of the outcomes under consideration, i. e. on Q. We 
would like to prove something similar in the quantum case: that a suitably generalized 
screening off condition is equivalent to the existence of a joint quantal measure on Q, all 
subject to appropriate conditions of setting independence and decoherence. Given our 
standing assumption of strong positivity, the Tsirel'son inequality would then follow from 
quantal screening off. 

We will propose a candidate for a condition of quantum screening off such that, if 
there exists a joint decoherence functional ID with the correct marginals, then a past can 
be cooked up, just as in the classical case, so that the resulting quantum measure Jl satisfies 
the proposed condition. The converse of this, that the candidate quantum screening off 
condition implies the existence of a strongly positive joint decoherence functional on O 
with the correct marginals remains conjectural for now. Wc will, however, prove that such 
a decoherence functional on O exists in the case of ordinary unitary quantum mechanics, 
and we will highlight the causality assumption that allows the construction. 
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To motivate the proposed quantal screening off condition, notice tliat tlie classical 
screening off condition (18) is equivalent to 

fi{ai n Pj n c)ii{c) = fi{ai n c)fi{Pj n c) . 

Tfie analogous condition on D is then our proposal for quantum screening off: 

D{ai n /3j n c ; afe n n c)D{c ; c) = D{ai n c ; afe n c)D{(3j n c ; n c) , (27) 

for all settings a, a and /3, /3, and for all fully specified pasts c, c [as in (18)]. More details 
and a proof that quantum field theory satisfies this condition formally will appear in a 
separate work [25]. (Conditions (27) include matrix elements that are off-diagonal in the 
instrument settings, a and /3. With the "minimalist approach", only the equalities with 
a = oi and (3 = (3 would be meaningful.) Notice that if D is completely diagonal, then (27) 
reduces to classical screening off. A variation on (27) asserts (in a shorthand notation) 
that^ 

D{ijc; klc) D{pqc; fsc) = D{pjc; Tic) D{iqc; ksc) (28) 

where c and c are as before, every other index stands for an event in region A or B, and 
indices appear in the order: A-event, S-event, mutual past. From (28) one can deduce 
that D decomposes as a product of the form 

D{ijc ■ klc) = F{ic ; kc) G{jc ; Ic) . (29) 

This formulation carries more information than (27) when D(c; c) = 0, which can happen 
non-trivially in the quantal case. 

We also take the quantum condition of setting independence to be 

D{a nj3nc; anpnc) = D{a ; a)D{l3 ; (3)D{c ; c) 

= l^{oi)ii{(3)D{c; c), 

where a e Ma, P £ Mb', c and c can be any two events in C; and we have assumed that 
D is diagonal in a and Finally, recall that all decoherence functionals are assumed by 
default to be strongly positive. 

The pattern might clarify as: D{ijk; lmn)D{i' j'k; I'm'n) = D{i'jk; l'mn)D{ij'k; Im'n) 
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In the next lemma, the augmented history space Q is the same space as appeared in 
the proof of Lemma 4.1, part 2. Also, of course, /j, is the quantal measure on fl and D its 
associated decoherence functional. 

Lemma 4.2 Let D be a decoherence functional on that decoheres on (a, /3) pairs 
and such that a and P are independent of everything else. Assume that the induced 
experimental probabilities p{ai, Pj) admit a joint quantal measure in the sense of the 
definition given above in equations (7) and (8). Then there exists a (strongly positive) 
decoherence functional D on 0, that agrees with D on all pairs of instrument events, and 
that satisfies quantum screening off. 

Proof As in the classical case, we assume that 11 contains no "irrelevant" events. We 
again concoct extra events {ii'jj'} in the region C that were not taken into account in Q. 
Our screening off condition for the new model based on Q is 



and similarly for the 3 other (a, (3) pairs, taking D to vanish when the instrument settings 
are off-diagonal e.g. 



D{a, n I3j n {kk'W} ; n /5„ n {pp'qq'}) D{{kk'll'} ; {pp'qq'}) 
= D{ain {kk'W} ; am n {pp'qq'}) D{Pj n {kk'W} ; P^ H {pp'qq'}) , 



(30) 



V a and (3. Define the decoherence functional D on by the equations. 



D(ai n bj n {kk'W} ; n 6n n {pp'qq'}) 

= SikSjiSmpSnq A*(o H 6) D{kk'W ; pp'qq') , 



(31) 



D{a'i, n bj n {kk'W} ; n 6„ n {pp'qq'}) = . 



From this, it can be seen that D{{kk'W} ; {pp'qq'}) = ]D{kk'W ; pp'qq'). 



Now, summing, for example, (31) over k, k', I, I' and p,p' , q, q' produces 




k'l'p'q' 



IJ,{ar)b)p{ai,bj)SimS. 

ll{ai n bj)S^mSjn 

D{ai n bj ; n bn) 



28 



using (8), (7) and (6). This shows that D takes the same values as D for all pairs of 
experimental settings and outcomes. 

As required, the setting-events are also independent of the added past variables with 
respect to D, for example: 

D{a nbn {kk'W} ; aHbn {pp'qq'}) = ^ D{ai n bj n {kk'll'] ; fl 6n n {pp'Qfl']) 

ijmn 

= n{anb)D{kk'U' ; pp'qq') 

= ll{anb)D{{kk'U'};{pp'qq'}) . 

The definition of D also gives 

D{ai n {kk'W} ; am n {pp'qq'}) = 6ikSmpli{a)D{kk'll' ; pp'qq') 
D{bj n {kk'W} ; bn n {pp'qq'}) = Sji6ngfl{b)D{kk'W ; pp'qq') , 

which implies (30) for a = a = a and (3 = (3 = b. Similar calculations can be done for 
every {a, (3) pair, and when the instrument settings are off-diagonal (30) holds trivially as 
both sides are zero. The new measure Ji thus satisfies quantum screening off. 

Finally, D is strongly positive because it is essentially just D which is strongly positive 
by assumption. 

QED 

We lack a proof of the converse of Lemma 4.2 (an analogue of part 1 of Lemma 4.1). 
But we can show that a strongly positive decoherence functional on VL with the correct 
marginals, and which decoheres on all (a, (5) pairs, exists in the case of unitary quantum 
mechanics (c/. our earlier discussion of concatenated Stern-Gerlach beam splitters with 
"recombiners" , which suggests more generally that Q, should be realizable in any "histories 
formulation" ) . 

In standard quantum mechanics, for any measurement a in A, there exist projection 
operators P", i = ±1, which project onto the subspaces of Hilbert space associated with 
the outcomes ±1 of the measurement. Plainly, P^^ -\- P^^ = 1. Similarly there exist 
operators P^ , j = ±1, projecting onto the subspaces of Hilbert space associated with 
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the outcomes ±1 of the measurement (3 in B. The standard causahty assumption is then 

Given this, it is easy to construct, analogously to (15), a joint decoherence functional 
on Q. with the desired properties. Let 

D{ii'jj'- kk'W) = TT{Pf,P^P^'P^poP^Pi,P^P^:) , (32) 

where po is the density matrix giving the pre-measurement state of the particles and the 
trace is over particle states. 

Lemma 4.3 D is strongly positive and has the correct marginals (8). 

Proof D has the canonical form of a decoherence functional of ordinary unitary quantum 
mechanics, which is known to be strongly positive [21]. To show that it has the correct 
marginals (8) on each (a,/?) pair, let us work out for example the case (a,/?) = (a, h): 

J2 Diii'jf ; kk'W) = TriP^PrpoP^Ph 

i'j'k'l' 

= Tr{PtpoPuP-) Sji 

= Tr{pQP^P^) 6ikSji = p{ai, hj) 6ikSji , 

using the cyclic property of the trace and -Pf -Pj* = Pj^jl middle line, and the posited 

commutativity of and Pj* in the last. QED 

Notice that D is only one of many decoherence functionals which satisfy the desired 
conditions. Instead of the product Pf P" , for example, we could have any convex combi- 
nation of P"P" and P" Pf . (It seems unlikely that the most general D can be obtained 
in this manner, though, because our ansatz here exhibits an extra decoherence not de- 
manded by the physics; for example, D{ij; k'l) := X^i'j'feZ' -^(^^'ii's kk'W) oc 5ji because 
Tr{P^P^PQP^',P^) oc P/Pf oc 5ji, even though a ^ o'.) 

Remark Even without commutativity, the above trace expressions would define deco- 
herence functionals D and Dctp for Q, and the and the marginals of D would still 
reproduce the Dap- In light of this, one might perceive the existence of a joint quantal 
measure as reflecting most directly the existence for the -Da/3 of trace expressions involving 
operators in a common Hilbert space (cf. [26]). The commutativity would manifest itself. 
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on this view, only in the fact that distinct a-outcomes continue to decohere independently 
of whether a /^-measurement is made, and independently of its outcome if it is. 

In the classical context, the existence of a joint probability distribution // on O is 
often described by saying that one can find non-contextual hidden variables capable of 
reproducing the given system of experimental probabilities. Adopting the same language, 
we can intepret Lemma 4.3 in the following manner: It is possible to attribute the correla- 
tions in the EPRB setup to non-contextual * hidden variables, so long as they are quantal 
hidden variables, governed by a decoherence functional rather than a classical probability 
distribution. 

V. Weak positivity is not enough 

We have seen that the condition of strong positivity leads to the Tsirel'son inequality. Can 
the inequality be violated by a decoherence functional that is only weakly positive (but 

otherwise observes the conditions of theorem 2)? That this is so can be seen simply by 
noting the continuity of Q in equation (13), and checking that Dsym in equation (17) is 
not on the boundary of weak positivity — meaning that it assigns no set a measure of 
exactly zero. We have verified this for all of the 2^^ — 1 non-empty subsets of Q. 

* By "non-contextual" we refer to the fact that the quantal measure ju on f2 is defined 
independently of any measuring instruments or their settings. In this sense, one can say 
that a given measurement (if suitably designed) "merely reveals" a particular value of 
/i, without participating in its definition. (In saying this, we are not asserting that, in 
any individual instance, the measurement "merely reveals", for example, the location of 
the silver atom without affecting it. This would be a much stronger claim, and possibly 
meaningless in a non-deterministic theory which provides no account of "what would have 
happened" in any individual instance, had the measurement not taken place.) 

^ That a non-contextual quantal measure fl exists where a non-contextual classical measure 
cannot, implies that the corresponding decoherence functional D fails to be diagonal; for 
a decoherence functional is classical if it is diagonal. And indeed the D constructed above 
in the case of unitary quantum theory is not easily seen to possess off-diagonal matrix 
elements. In the framework of the "consistent histories" point of view this means that the 
coarse-grained histories specified by (oj, a'^,, hj, b'j,) fail to decohere, and it is consequently 
not possible to assign probabilities simultaneously to all of the "quantal hidden variables" . 
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In fact, one might go further and ask whether the maximum possible value of Q = 4 
can be attained by a weakly positive decoherence functional. The answer is yes, and there 
exist remarkably simple examples. Here are the elements of one such example obtained by 
the lp_solve linear programming solver [27]. 

D{ ; ) = D{+ -+-; + - +-) = D{+ --+; + - -+) 

= 5(- + -+;- + -+) = ^ 

-D{ +; ) = -D{+ -+-; + ) = D{- + -+;- + — ) 

= D{ +; + + — ) = -D{+ --+; + + — ) 

= -D{- + -+; + + ) = D{+ --+; + - +-) 

= -D{+ - -+; +) = -D{- + -+; +) 

= 5(- + -+; + --+) = ^ 

The remaining elements which are not equal to one of the above by Hermiticity are zero. 
For this decoherence functional, one checks that 

X{a, h) - X(a , h) + X(a, h') + X(a', h') = 4 

For consistency with theorem 2, strong positivity must be violated by any D which 
violates the Tsirel'son bound. One can check that the above D does so, with four negative 
signs, four positive signs, and eight zeros in its signature. 

In the context of the Bell inequalities, then, the strong positivity condition of quan- 
tum measure theory shows itself to be much stronger than the weak one. To the extent 
that weak positivity is physically acceptable, one can imagine a generalized form of quan- 
tum mechanics (a generalized measure theory remaining at level two) which affords the 
maximum possible violation of the CHSHB inequality. Strong positivity, in contrast, is as 
restrictive as ordinary quantum mechanics in this respect. 

One other feature of the above matrix D seems worthy of notice here. All the marginals 
of the form /2(a±), //(a^j_), etc. take the value 1/2, which is recognizable as the only "causal" 
value. That D yields Q = 4 implies perfect correlations (or anti-correlations) between A 
and S, and any other marginals than 1/2 would let Alya signal to Bai by manipulating 
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the settings of her analyzer. But this could not happen with the above D because "non- 
signaling" is built into the requirements we have imposed on it in equations (8), which 
imply directly that ^jP{ai, bj) = p{ai, b^,). 

VI. Conclusion 

One can view quantum mechanics as a dynamical schema that generalizes the classical 
theory of stochastic processes in such a manner as to take into account interference be- 
tween pairs of alternatives. Within the framework appropriate to such a view — that of 
"quantal measure theory" — we have sought quantal analogs of some of the relationships 
that emerge in connection with correlated pairs of spin-| particles when one contemplates 
tracing their behavior to the dynamics of some underlying stochastic, but still classical, 
variables ("hidden variables"). One knows that classically, the existence of a joint prob- 
ability measure on the space of experimental outcomes is equivalent on one hand to the 
CHSHB inequality, and on the other hand to screening off. (This equivalence shows that 
the existence of hidden variables is intimately linked to causality.) Quantally, one might 
desire an analogous set of equivalences relating (1) the existence of a joint decoherence 
functional on the space of experimental outcomes; (2) Tsirel'son's inequality; and (3) 
some quantal causality condition generalizing classical screening off. We have shown — 
assuming strong positivity of the decoherence functional — that (1) implies (2), and that 
(1) also implies (3) if the latter is represented by the candidate condition (27). A proof of 
the converse, that (3) implies (1), would greatly strengthen the links with causality. We 
did not provide such a proof in general, but we did show that (1) follows from standard, 
unitary quantum mechanics with spacelike commutativity. 

It is perhaps worth emphasizing that, just as the CHSHB inequality follows from 
the exceedingly general assumption of the existence of a joint probability distribution 
on Q (in effect, a probability distribution for non-contextual hidden variables), making 
no statements concerning the nature of the classical dynamics save that it is given by 
a probability measure on a suitable history-space, so also the Tsirel'son inequality is a 
consequence only of the bilinear (level 2) structure of quantum theory. We have seen 
in fact that it follows from the mere existence of a (strongly positive) joint decoherence 
functional, without making any assumption that the latter has the form taken by ordinary 
unitary quantum mechanics. The inequality, is in this sense a statement concerning the 
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predictive structure of quantum mechanics itself, rather than anything to do with any- 
specific dynamical law. 

If strong positivity is discarded, we can violate the Tsirel'son inequality with a quantal 
{i.e. level two) measure, and we have even seen that the "logical" bound of 4 for the 
quantity Q of equation (13) can be achieved then. The corresponding non-local correlations 
are of interest in information theory, since they would allow certain communication tasks to 
be performed with fewer classical bits transferred than are demanded in standard quantum 
mechanics [28]. Quantal measure theory, or equivalently generalised quantum mechanics, 
provides for such correlations, but only if strong positivity is relaxed to weak positivity. 
Whether this is physically appropriate is doubtful, however. Apart from the Hilbert space 
constructions that it affords, a compelling physical motivation for strong positivity concerns 
the composition of non-interacting sub-systems [29] [15]: Strong positivity is preserved 
under such composition whereas weak positivity is not. ^ (Might this difference lead to 
experimental tests that could distinguish between the two types of positivity?) 

For higher level measures, we speculate that imposing an analog of strong positivity 
would lead to higher level inequalities still weaker than (12), but it is beyond our current 
powers to pursue this idea since, beyond level two, we lack the analog of the decoherence 
functional, in terms of which an extension of the strong positivity condition could be 
framed. 

Let us accept provisionally that the existence of a joint decoherence functional is 
a necessary condition for relativistic causality. Then we can claim the following: If an 
EPRB-type experimental setup is ever found to violate the Tsirel'son inequality, then all 
causal theories in the framework of generalised quantum mechanics with a strongly positive 
decoherence functional are contradicted. However, as long as no super luminal signaling is 
seen, such an experimental result would not rule out causal generalised quantum theories 
altogether, if one were willing to accept that the world may be described by decoherence 
functionals that are not strongly positive. Another alternative would be to generalize to 

^ This fact is closely related to the fact that tensor products of so called completely positive 
maps are also completely positive. 
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a higher order measure, in which case the challenge would be to develop good dynamical 
models within this at present loosely constrained class of theories. 

A convincing quantal analog of the screening off condition would have an interest going 
far beyond its relevance to experiments of the EPRB type. In connection with quantum 
gravity, the condition of "Bell causality" was the guide that led to the family of (classical) 
dynamical laws derived in [30] for causal sets. Screening off as such lacks a clear meaning 
against the backdrop of a dynamical causal structure, but Bell causality is perhaps as close 
as one could have come to it in the causal set context. For this reason, among others, it 
seems clear that progress in identifying the correct quantal analog of classical screening 
off would help point the way to a causality principle suitable for the needs of quantum 
gravity. 

The existence of a joint probability measure for our 16-element sample space can be 
interpreted as the necessary and sufficient condition for the existence of "hidden variables" 
which determine "non-contextually" the measurement outcomes. That Bell's inequality is 
violated in nature tells us that no such hidden variables are possible classically. Not so in 
quantal measure theory, however, and we described a model in which the "quantal hidden 
variables" could be identified concretely with particle worldlines. Our main finished result 
in this paper was that the existence of such variables can be seen as the reason for the 
Tsirel'son inequalities. However, non-contextuality is only part of the story. Whether such 
variables can be "causal" as well as "non-contextual" is a question whose answer awaits a 
better understanding of the concept of "quantal screening off" . 
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